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Abstract 

Correlation across transmit antennas, in multiple antenna systems (MIMO), has been studied in 
various scenarios and has been shown to be detrimental or provide benefits depending on the particular 
system and underlying assumptions. In this paper, we investigate the effect of transmit correlation on 
the capacity of the Gaussian MIMO broadcast channel (BC), with a particular interest in the large-scale 
array (or massive MIMO) regime. To this end, we introduce a new type of diversity, referred to as 
transmit correlation diversity, which captures the fact that the channel vectors of different users may 
have different, and often nearly mutually orthogonal, large-scale channel eigen-directions. In particular, 
when taking the cost of downlink training properly into account, transmit correlation diversity can yield 
significant capacity gains in all regimes of interest. Our analysis shows that the system multiplexing 
gain can be increased by a factor up to [M/r\, where M is the number of antennas and r < M is 
the common rank of the users transmit correlation matrices, with respect to standard schemes that are 
agnostic of the transmit correlation and treat the channels as if they were isotropically distributed. Thus, 
this new form of diversity reveals itself as a valuable “new resource” in multiuser communications. 
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I. Introduction 

Traditionally, channel fading had been considered as a harmful source to combat with transmit 
or receive diversity. However, since the seminal work of [I], |2], independent fading in multiple- 
antenna (MIMO) systems has been shown to provide large capacity gains, thanks to the fact that 
the number of degrees of freedom of such MIMO channels grows with the minimum of the 
transmit and receiving antennas. Focusing on downlink communication, we consider the case 
where the transmitter (base station) is equipped with a number of antennas, and the receivers 
(users) are equipped with a single antennaQ In a typical system geometry where the base station 
array is mounted on a tower or on a relatively tall building, and the propagation between the 
base station and the users occurs along clusters of scatterers that are seen from the base station 
on a narrow angular spread, the coefficients of the channel vector describing the propagation 
between the base station array and a given user are correlated Gaussian random variables. In 
contrast, the channel vectors of different users, which are physically separated by multiples 
of the wavelength^] are mutually statistically independent. Spatially correlated MIMO channels 
have been well characterized for a variety of transmit correlation models 0-Q. Traditionally, 
transmit correlation has been considered to be a detrimental source, thereby incurring power loss 
at high signal-to-noise ratio (SNR) (e.g., 0 ). Some exceptions where transmit correlation helps 
capacity are the case of low SNR [j5j, [|6j, where the capacity-achieving input covariance is non¬ 
isotropic, and the case where channel state information (CSI) is not available at all |8|, for which 
knowing the statistics of the channel may effectively help. The impact of transmit correlation on 
the ergodic capacity is much less known in the multiuser context, albeit the capacity region of 
the Gaussian MIMO BC with perfect CSI at both transmitter and receivers is fully understood 
[9j irrespectively of the channel statistics. The work of [ 101 extended the sum-rate scaling result 


1 In practice, a user device may be equipped with multiple antennas, which are then combined in order to form a beamforming 
pattern, i.e., a directional antenna, to achieved beamforming gain and/or inter-cell interference rejection. What matters for the 
analysis in this paper is that each user receives a single data stream, i.e., even though the device is equipped with multiple 
antennas, their output is combined and demodulated as a single stream. 

Tor example, for a typical carrier frequency between 2GHz and 5GHz, the channel wavelength is between 15 and 6cm. 
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of [11] to the special case where all users have a common channel covariance matrix, and 
concluded that transmit correlation has a detrimental impact on the sum capacity of multiuser 
MIMO (MU-MIMO) systems. 

A different line of works has pointed out that transmit correlation can be in fact advantageous 
for MU-MIMO systems, with respect to aspects such as CSI feedback overhead, scheduling, and 
codebook design [T2|-[jl5]). The key observation is that, in a multiuser environment, there exist 
diverse transmit correlations across multiple users. Basically, different transmit correlations indi¬ 
cate different “large-scale” (or long-term) preferential directions of the user channels, depending 
on the geometry of local scatterers and transmit antenna spacing. Therefore, the diversity of 
transmit correlations can be leveraged in the multiuser communication framework. In order to 


fully exploit such effect, the authors in [161 introduced an optimistic condition by imposing a 
useful structure on channel covariances of users, referred to as tall unitary structure , for the 
MU-MIMO downlink. The resulting joint spatial division and multiplexing (JSDM) precoding 
strategy was extensively studied to show that the tall unitary condition holds asymptotically at 
least for the case of uniform linear array (ULA) with large number of antennas, whenever the user 
groups have scattering with disjoint angular support [ 17]. Further work along this line of research 
has considered: 1) the system capacity in the large system regime (both the number of antennas 


and the number of users grow to infinity at a fixed ratio) [18]; 2) schemes for opportunistic 
beamforming with probabilistic scheduling [19]; 3) the suitability of the JSDM framework for 


millimeter wave (mm-Wave) channels [20]; 4) the elimination of pilot contamination in multi¬ 
cell TDD systems where different cells share the same set of uplink pilot signals [21]; 5) how 
to design coordinated composite beamforming schemes where the knowledge of the transmit 


correlation can be used to improve the performance of multi-cell MU-MIMO networks [22], 


[231. 


In this paper, we coin the term transmit correlation diversity for the new type of diversity 
under the ideal structure, where users are partitioned into groups such that all users in the same 
group have the same channel covariance and the eigenspaces spanned by the channel covariances 
of groups are mutual orthogonal (or linearly independent for a weaker condition). Assuming this 
unitary condition and the symmetric case where all group covariances are of rank r < M, where 
M denotes the number of base station antennas, the number of degrees of transmit correlation 
diversity can be expressed as G = [M/r\. While previous related works paid great attention 
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to lower bounds on the performance gain achieved by exploiting transmit correlation, e.g., by 
focusing on the achievable sum-rate analysis of JSDM for large-scale MU-MIMO systems in 
in this work we rather focus on information-theoretic upper bounds in order to provide 
some new insights into the role of transmit correlation diversity with respect to the capacity of 
MIMO BCs. Specifically, we wish to answer to the following questions: In which regimes of the 
system parameters including M, the number of users K, G, and T c , where T c is the coherence 
time interval, can transmit correlation diversity be beneficial to the capacity? What are the 
upper bounds on its potential gain in the regimes of interest, compared to the capacity of the 
independent and identically distributed (i.i.d.) Rayleigh fading MIMO BC? 

Assuming perfect CSI, we first study the impact of transmit correlation diversity on the power 
gain (the parallel shift of capacity versus SNR curves, also known as power offset) of MIMO 
BC. The authors in [ji~8|, [19) characterized the asymptotic capacity behavior in the large number 
of users regime. In sharp contrast to [ |T0| , it turned out that transmit correlation diversity can 
achieve a sum-rate gain of up to MlogG (i.e., power gain of 31ogG dB). However, it was not 
fully understood why we could do better than the i.i.d. Rayleigh fading case in this regime. In 
addition, it was not known whether transmit correlation diversity can achieve capacity gains in 
other regimes of the system parameters, compared to the independent fading case. To this end, 
we need to investigate the impact of transmit correlation on the power gain of MIMO BC in 
various regimes. It turns out that transmit correlation diversity may be even harmful, especially 
when M is larger than K , under the idealized assumption of perfect CSI at the base station. 

Taking the cost of downlink training into consideration rather than assuming perfect CSI, the 
well-known limit on the sum rate of the i.i.d. Rayleigh block-fading MIMO channel immediately 
provide a cut-set upper bound, following from the work of Zheng and Tse [ |24| , Sec. V] (see also 
[25)), as already noticed in [17], [26]. Namely, the high-SNR capacity of the resulting pilot-aided 
systems is limited by M* id ( 1 — M* id /T c ) log SNR + 0(1), where M* id = min{M, K, |_T C /2J}. For 
typical cellular downlink systems with M small, where min {M,K} <C T c , the factor T c /2 does 
not significantly affect the system performance. However, in the large-scale array regime with 


M > T c , to which a great deal of attention has been paid since [27], this factor is shown to 
have a critical impact on the system performance. To be specific, no matter how large both 
M and K are, multiplexing gain is fundamentally saturated by T c /A when T c j 2 < min (M,K). 
Interestingly, this limit is obtained by letting only T c j 2 users send uplink pilot signals on the first 
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half of the coherence block, i.e., for Tj 2 dimensions, and using the remaining T c /2 dimensions 
to serve these users by spatial multiplexing. Thanks to the TDD reciprocity, M can be made 
arbitrarily large, thus going to the regime of massive MIMO. Notice that this scheme that uses 
half of the coherence block for uplink training and the other half for downlink data transmission 


is precisely what was provided as an LTE-motivated example in [27]. As a result of this upper 
bound, when both M and K are large, the coherence time T c becomes the system dimensionality 
bottleneck, which fundamentally limits the system multiplexing gain. Note that the above upper 


bound in [24] holds only for isotropically distributed channels. As a consequence, any scheme 
based on uplink training and TDD reciprocity that is agnostic of the channel statistics, i.e., which 
treats the channels as if they were independent isotropically distributed random vectors, must 
also obey to such bound. In contrast, in this work we shall show that this is not necessarily the 
case in spatially correlated fading channels, for which it is possible to take explicit advantage 
of the knowledge of the channel statistics (i.e., of the covariance matrix) in order to break the 
above dimension bottleneck. In particular, we find that the multiplexing gain can continue to 
grow as M and K increase, provided that the degrees of transmit correlations diversity are 
sufficiently large. Therefore, transmit correlation diversity is indeed beneficial to significantly 
increase the multiplexing gain of MU-MIMO systems, as well as the power gain in some regimes. 
By taking the CSI estimation (downlink training) overhead into account, we show that transmit 
correlation diversity is beneficial in all regimes of the system parameters, apart from the case 
where min {M,K} is too small. 

The remainder of this paper is organized as follows. Section [IT] describes the MU-MIMO 
downlink system model of interest and briefly reviews a key result of JSDM with the notion 
of transmit correlation diversity. In Section [TTT| we study the impact of transmit correlation to 
the power gain of MIMO BCs in several regimes of system parameters, assuming perfect CSI. 
Section [TV] investigates the impact of transmit correlation to the multiplexing gain of pilot-aided 
MU-MIMO systems, where the cost of downlink training is considered. We conclude this work 
in Section [V] 

Notation : A H and A, (A) denote the Hermitian transpose and the Ah eigenvalue (in descending 
order) of matrix A. tr(A) and \A\ denote the trace and the determinant of a square matrix A. I n 
denotes the nxn identity matrix. ||a|| denotes the ( 2 norm of vector a. We also use x ~ CAf(0; £) 
to indicates that x is a zero-mean complex circularly-symmetric Gaussian vector with covariance 
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£. Z + denotes the set of positive integers. The base of the logarithms used in this work is 2. 
Finally, let C sum (^4), where A is a subset of SNR, M, K, r, and G, denote the asymptotic sum 
capacity of MIMO BC when system parameters in the set A are sufficiently large. 


II. Preliminaries 

A. System Model 

Consider a MIMO BC (downlink) with M transmit antennas and K users equipped with a 
single antenna each. For spatial correlation between transmit antennas, we use the well-known 
Kronecker model [3], [4] (or separable correlation model) H = RW. where there is no receive 
correlation due to single-antenna users, the elements of W are i.i.d. ~ CAT A. 1), and R denote 
the deterministic transmit correlation matrices, respectively, assuming the wide-sense stationarity 
of the channels. The random matrix H follows the frequency-flat block-fading model for which 
it remains constant during the coherence time interval of T c but changes independently every 
interval. Most of our results in this paper remain valid in the more general unitary-independent¬ 
unitary model (for which see Q), since the elements of W are allowed to be independent 
nonidentically distributed to apply some well-known results of random matrix theory to be used 
in this paper. 

In this paper, we let R normalized as tr(R) = M without loss of generality for all users. By 
using the Karhunen-Loeve transform, the channel vector of a user can be expressed as 


h = UA?w 


( 1 ) 


where A is an r xr diagonal matrix whose elements are the non-zero eigenvalues of R. U <E C Mxr 
is a tall unitary matrix whose columns are the eigenvectors of R corresponding to the non-zero 
eigenvalues, i.e., R = UAU H , and w <G C rxl ~ CAT ( 0 , 1 ). 

Let H denote the M x K system channel matrix given by stacking the K users channel vectors 
h by columns. The signal vector received by the users is given by 

y = H H Vd + z = H h x + z (2) 

where V is the M xs precoding matrix with s the rank of the input covariance £ = Eja;x H ] (i,e., 
the total number of independent data streams), d is the s-dimensional transmitted data symbol 
vector such that the transmit signal vector is given by a: = Vd, and z ~ CJ\f(0,I) is the Gaussian 
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noise at the receivers. The system has the total power constraint such that tr(£) < P, where P 
implies the total transmit SNR. 


B. Summary of JSDM 


We briefly review the JSDM strategy [16] that was originally introduced to reduce the cost for 
downlink training and CSI feedback in FDD large-scale MIMO systems by exploiting the fact 
that some users have similar transmit correlation matrices. The idea is to group together users 
with similar transmit correlations and then separate the different groups by a pre-beamforming 
matrix which is calculated only as a function of the channel second-order statistics, and does 
not depend on the instantaneous CSI. This creates a sort of spatial division, that exports the fact 
that the “long-term” preferential direction of the channel vectors of users belonging to different 
groups are nearly mutually orthogonal. In general, we have multiple sets of quasi-orthogonal 
groups, which we call classes. Each class t is served separately over an orthogonal transmission 
resource (i.e., a time-frequency slot) and may have a different number of groups, denoted by G t . 
Therefore, we partition the entire user set, /C = {1,2, •• • , K}, into T non-overlapping subsets 
(classes). 

As anticipated before, the JSDM precoder V = BP is formed by two-stages B and P. 
The first stage, referred to as pre-beamforming, consists of a matrix B of dimensions M x b, 
where b < M is an intermediate dimension whose optimization is discussed in [17]. The pre¬ 
beamforming matrix depends only on the channel second-order statistics, i.e., on the channel 
covariance matrices of the different groups served simultaneously by spatial multiplexing (i.e., 
belonging to the same class). Since B depends only on the second-order statistics, which is 
very slowly timc-varyingj^] it can be considered as perfectly known by the base station. The 
second stage consists of a MU-MIMO precoding matrix P , of dimension b x s, determined as 
a function of the instantaneous realization of the projected effective channel H h B. We divide b 


’Strictly speaking, for the classical Wide-Sense Stationary (WSS) fading channel model, the second-order statistics is time- 
invariant and therefore it can be estimated by time-averaging at negligible overhead cost. As a matter of fact, due to non-stationary 
effects such as large user motion, and users joining and leaving the system, the WSS assumption holds only “locally” on a 
time scale which is anyway orders of magnitude slower than the channel coherence time T c . Therefore, while the estimation 
and tracking of the channel covariance matrix is an interesting topic in itself, it is safe to assume here that this is known at no 
significant overhead dimensional cost. 



and s such that b = b g and s = s g> where t>g>Sg for all g — 1,..., G, and denote by 
B g the M x b g pre-beamforming matrix of group g. Thanks to the above user partitioning, we 
can consider estimating only the G diagonal blocks 



(3) 


where H g is the aggregate channel matrix given by stacking the channel vectors of users in 
group g, and each group is independently processed by treating signals of other groups as 
interference. In this case, the MU-MIMO precoding stage takes on the block-diagonal form 
P = diag(Fi >''' ,Pg), where P g e C 6sXSs , yielding the vector broadcast plus interference 
Gaussian channel 



(4) 


h^g 


for g = 

C. Transmit Correlation Diversity 

We introduce the notion of transmit correlation diversity to better understand the key idea 
of JSDM. Fig. |T| depicts a simple example which explains the virtual sectorization enabled by 
exploiting diverse transmit correlations in a three-sector BS with D = 1/2 in the following steps. 

1) In the beginning, angular regions (pie slices drawn by AoD[] and AS at the BS) roughly 
representing long-term eigenspaces are overlapped, i.e., user groups are interfering with 
each other and there is no noticeable structure. 

2) Put together the red angular regions into class t — 1 and separate them by multiple pre¬ 
beamforming along their respective eigenspaces. By doing so, each group can be viewed 
as a virtual sector. 

3) Do the same thing on the blue regions for class t — 2. 

4) Multiple users within each group (i.e., virtual sector) can be simultaneously served by the 
second-stage MU-MIMO precoding. 

Given the geometric intuition provided by the clustered scattering correlation model (e.g., 
one-ring model [3j), we define transmit correlation diversity of a multiuser system as follows: 

4 AoD and angle of arrival (AoA) are generally different in FDD. As AoD is more precise at the transmitter side, we prefer 


the terminology AoD to AoA. 
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AoD 



Fig. 1. Illustration of virtual sectorization exploiting transmit correlation diversity with T = 2 and G = 4. 


Definition 1 (Transmit Correlation Diversity). A multiuser MIMO downlink system after user 
partitioning is said to have G degrees of transmit correlation diversity, if the eigenspaces of G t 
groups in class t are mutually linearly independent for all classes and G = j. G\. 


Notice that the number of groups per class, G t , must be an integer by definition. Although 
transmit correlation diversity is formally defined conditioned on the linear independence between 
the group eigenspaces, the effect of transmit correlation diversity does not necessarily require the 


condition, as will be shown later in Sec. III-D If the group eigenspaces are linearly independent 
each other, we can exactly separate (orthogonalize) the group eigenspaces by using the block 
diagonalization pre-beamforming. However, we assume mutual orthogonality between group 
eigenspaces, which is a stronger condition than linear independence, for the sake of ease of 
analysis in this work. We also let T = 1 unless otherwise is mentioned, since each class consumes 
orthogonal resources. It is further assumed that G groups are formed in a symmetric manner 
such that each group has the same number K' = K/G of users and the same rank r = M/G 
of R g = UyAyU y for simplicity, where G divides both K and M. It is not difficult to extend 
to the general case of multiple classes and asymmetric per-group parameters with orthogonality 
replaced by linear independence. In the sequel, we present an ideal structure of the transmit 
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correlations taken from [16]: 


Definition 2 (Unitary Structure). We say that a set of user groups has unitary structure of their 
transmit correlations if all users in group g have a common correlation eigenspace spanned by 
the tall unitary matrix U 9 , and if the M x rG matrix Lf = [U i, • • • , If a] is unitary such that 

u"u = UU H = I. 


For the general case of rG < M, this structure is called tall unitary such that f/ H U = /. 
Under the unitary structure, we just let b g = r and B = U_. These choices eliminate interference 
between G groups and the resulting MIMO BC is given by 

y g = H " g P,d, + z 9 = W»AfP s d, + z, (5) 


where W g is an r x K' matrix with i.i.d. elements ~ CAf(0, 1), for all g, thereby yielding the 
reduced column dimensionality of the effective channel H y . Using (j5j), we arrive at the following 
simple yet important result [ |T6[ Thm. 1]: Under the unitary structure, the ergodic sum capacity 
of the original MIMO BC Q with perfect CSI is equal to that of parallel subchannels ([5]) with 
reduced dimensional H (/ , given by 

G r I 


E e 


max 

.£?=itr (S g )<p 


log 


/ + A l J 2 W a S u W"\ l J 2 


( 6 ) 


where S g denotes the diagonal K' x K' input covariance matrix for group g in the dual multiple- 
access channel (MAC). This can be intuitively verified by noticing that the effective channel H (y 
with reduced dimension of K' x r is unitarily equivalent to the original channel H g of K' x M 
under the unitary condition so that we can get effective channel dimension reduction without 
loss of optimality. This dimension reduction effect provides significant savings in CSI uplink 
training (for TDD systems) and both in downlink training and CSI feedback (for FDD systems) 
by a factor of G. 


III. Impact of Transmit Correlation to the Power Gain of MIMO BC 


It was shown in [18] that, in the large number of users regime, transmit correlation may 


significantly help the capacity of Gaussian MIMO BCs. One may think that if we can fully 
exploit the ideal unitary structure, we might do better than the independent fading case also 
in other regimes of interest. Assuming perfect CSI throughout this section, we will show that 
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this holds true in some cases but not in all cases of interest. This is due to the fact that there 
is a tradeoff between power loss resulting from the effective channel dimension reduction and 
beamforming gain from pre-beamforming across the long-term eigenspaces of the user groups. 
In order to understand the tradeoff, we carefully investigate the impact of transmit correlation 
on the power gain of the Gaussian MIMO BC for different regimes in terms of r = M/G, 
K' = K/G, and G. 

In the sequel, we first consider the asymptotic capacity bounds of correlated fading MIMO- 
BCs in the high-SNR regime and then characterize the high-SNR capacity in the large M regime 
in a compact form. We also compare these results with the corresponding independent fading 
case in order to see if there exist potential benefits of transmit correlation to the power gain of 
the channels. 


A. High-SNR Analysis 

For M fixed, we investigate the ergodic capacity bounds of MIMO BC at high SNR to capture 
the power offset between the i.i.d. Rayleigh fading channel and the correlated Rayleigh fading 
channel satisfying the unitary structure. Since a closed-form characterization of the ergodic sum 
capacity of MIMO BCs is very little known even in the perfect CSI (i.e., perfect CSI at the 
transmitter (CSIT) as well as at the receivers) case |28J, we rely on some upper and lower 
bounds. Using the result in ([6]), the well-known high-SNR equivalence between MIMO point- 
to-point channel and MIMO BC [|29|, [30] (referred to as asymptotic point-to-point equivalence 


in this paper), and random matrix theory in [ |3Tj Thm. 2.11], we get the following bounds on 
the high-SNR capacity behaviors of correlated fading MIMO BCs. 


Theorem 1. Suppose perfect CSIT on H g in (|3]) and the unitary structure of the users channel 
covariances as in Definition [2] For r < K', the high-SNR capacity of the corresponding MIMO 
BC with correlated Rayleigh fading satisfies 

P G 

M log + °(1) < C sum (P) — (ilTlog — + ^log |A S | + n(K',r)j < o(l) (7) 

9= 1 

where n(x, y ) = yG(— 7 +X ^=2 7 + UU ELx-j+i f) loge w/f/z 7 the Euler-Mascheroni constant, 
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o(l) goes to zero as P —» oo. For r > K', we have 

G K ' \ P 

EE log + o(l) < C sum (P) - [K log — + «(r, A")) < o(l) (8) 


9=1 i =1 


where X, hl is the ith diagonal element of A g . 

Proof: See Appendix [A] □ 

The above result can be generalized to the tall unitary structure for which M > rG and M 
in ([7]) is replaced by rG. The lower bound in ([T]) for the r < K' case may become rather loose 
when r <C AT'. This is because the inherent multiuser diversity in the case of a very large number 
of users cannot be captured by the asymptotic point-to-point equivalence used to prove the lower 
bound (see (45) in Appendix [A]) . Nevertheless, we will use these bounds in the sequel since they 
remain fairly tight as long as K'/r is small. The upper bound in ([7]) becomes asymptotically 
tight when the receivers inside each grouj£] are allowed to cooperate, which we call intra-group 
cooperation in this work. In the case of r = K' (i.e., M = K), 0 coincides with ([8]) and it 
is asymptotically tight. In addition, if we relax the unitary structure as the tall unitary structure 
where M > rG, |7]) becomes tight at high SNR for r = K' but M > K as well. 

Remark 1. An alternative expression of the asymptotic capacity behavior for r > K' can be 
found by utilizing the approach in & H0 Comparing with the alternative characterization 
and other previous results /5j/, /6]/ for the point-to-point MIMO case, we can see that (|S]) in 
Theorem [ 7 ] is more intuitive and insightful. For example, ([§]) will be used in Sec. IIFB to show 
that, for r > K', in general we cannot do better than the independent fading case. Moreover, 
our result reveals that the impact of transmit correlation diversity to the high-SNR capacity in 
fact depends only on the non-zero eigenvalues of R,,. This is also the case in the point-to-point 
case where G — 1. 


It immediately follows from [7] and [29j that, for M > K, the high-SNR capacity of the i.i.d. 


5 Since users in a particular group are often closely located, the intra-group cooperation within such a group is more feasible 
than the full cooperation across all users over the entire BS coverage. 

6 Although these point-to-point results assume that only the distribution of a channel is accessible at the transmitter, the 
difference from the perfect CSIT case that we are assuming vanishes at high SNR when the number of receive antennas is 
greater than or equal to the total number of transmit antennas (this is the case of the dual MAC in |6l when r > K 1 ). 
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Rayleigh fading MIMO BC with perfect CSI behaves like 

Cjr(P) = K log f + k(M, A') + o(l). 


( 9 ) 


In what follows, we focus on the r > K' case to investigate the impact of transmit correlation 
diversity compared to the independent case. To do so, we will make use of the affine approxima¬ 


tion introduced by Shamai and Verdii [331. The high-SNR capacity C(P) is well approximated 
by the zero-order term in the expansion of the capacity as an affine function of SNR (P) 


C(P) =5 00 (logP-£ 00 ) + o(l) (10) 

where S°° = limp-^ is the multiplexing gain and £°° = limp.**, (logP — is the 

power offset. Using the quantity £°°, for r > K ', we consider the difference between £^d an d 
£^, where £-g is the power offset of <(9|) and £^ is the power offset of the upper bound in ([8]) 
denoted by Q™(P), as shown by 


r*oo r*oo 

^nd ^ub 


lim 

P —^OO 



Q7(P)\ 
s°° ) 


3 

K 


G K' 




g =i *=i 


eigen-beamforming gain 


channel dimension loss 


(ii) 


where 3 ~ 10 log 10 2 is due to the fact that the power offset £°° is in units of dB (i.e., horizontal 
offset in capacity versus SNR curves). While the first term in ( |TT| ) can be a positive constant 
with P depending on the degrees of transmit correlation diversity and the condition number of 
A g , the second term is only non-positive. Since the former (along with A Y2^=i l°g |A S | in ^ 
for r < K') indicates the power gain due to pre-beamforming along group eigenspaces inherent 
in the unitary structure, we call this eigen-becimforming gain. The latter will be referred to as 
channel dimension loss as the channel dimension reduction in ([5]) incurs such a loss in power 
offset. As a result, transmit correlation diversity turns out to yield power loss as well as power 
gain. We also observe a tradeoff between the eigen-beamforming gain and the channel dimension 
loss as G is inversely proportional to r for M fixed. 

Let us first consider the case of r = K' (i.e., M = rG = K), in which the high-SNR behavior 
in ([8]) reduce to 

£ sum (p) = Mlog-^ + M( -7 + J^j log e + log | A s | + o(l). 

\ 1=2 1 J 9=1 


(12) 
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In order to investigate the tradeoff and easily evaluate the difference in ( |TT[ ). we can upper-bound 
the asymptotic capacity behavior in ([ 12 ]) by letting 


V = y = G 


(13) 


for all (g,i) (For details on this upper bound, see (47) in Appendix [a] and (59) in Appendix [c]). 
Then, the eigen-beamforming gain in ( [IT] ) is upper-bounded by 3 log G. Using the approximation 
of the Harmonic number 


1 , (( m i n + 

=7 + ln n +2^- 


£=l 


m =2 


m 


, 1 1 

= 7 + In n 4 -- 

' 2 n 12 n 2 


120 ?r 4 


0(n~ b ) 


where £(•) is the Hurwitz zeta function, we have 

111 
2 r 


r*oo 

'Mid 


^ub — 3 ( ^ 


= 3 


G 


12r 2 2 M 
1 G 2 — 1 


+ 


2 M 


12 M 2 


12 M\ 
loge + 0 (M -4 


loge + 0(M 


— 4 \ 


< — log e dB. 
2 r 


(14) 


This shows that, assuming the optimistic condition ( JT3| ) on the condition number of A g , the power 
gain can be positive but marginal. By investigating the r > K' case in a similar way with some 
manipulations, we can see that - w _ ELr-jt'+i f) loge + 0 (M“ 4 ), 


implying that even the marginal gain diminishes for r > K'. Accordingly, transmit correlation can 
be detrimental to the capacity depending on the condition number of A g . The eigen-beamforming 
gain is shown to be almost completely offset by the power loss due to the channel dimension 
reduction in this case. As a consequence, transmit correlation diversity in general provides no 
capacity gain for the unitary structure with r > K'. 


B. Large K Analysis 

We first consider the case where r is not significantly larger than K'. If the intra-group 
cooperation is allowed, the high-SNR capacity of correlated fading BCs can approach the 
corresponding i.i.d Rayleigh fading point-to-point case, depending again on the condition number 
of Ay. In contrast, the independent fading BC case needs the full cooperation to achieve the 
same high-SNR capacity. But, this seems infeasible and the corresponding channel is not a BC 
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any more. As a result, transmit correlation diversity is beneficial at least in this sense for r < K' 
but not r <C K'. 

The sum-rate scaling in the large K regime where r <C K', relevant in practice for hot-spot 


scenarios, was already addressed in [18], but without sufficient exposition. It is well known from 
Sharif and Hassibi pT| that the sum capacity of the i.i.d. Rayleigh fading MIMO Gaussian BC 
scales like 

C°7(K) = M log ^ + M log log K + o(l). 


In the special case where all users have both the same SNR and the common transmit correlation 


matrix R of full rank, the authors in 10] proved that the sum capacity scales like M log j- f + 
M log log K + log \R\ + o(l) where log \R\ < 0 due to tr (R) = M. 

Notice that the case where all users have the same correlation corresponds to the case of 
G — 1, i.e., the poorest case of transmit correlation diversity. As a matter of fact, for G > 1 
under the unitary structure where different groups of users have orthogonal eigenspaces, it was 


shown in [[18 ] that, for fixed M and large K, the asymptotic sum capacity of correlated Rayleigh 
fading MIMO BC is 


C sum (K) = Mlog ^ + M log log K + log I A, | + o(l) (15) 

9= 1 

where the detailed achievability proof is given in Appendix [B] This shows that, for the large K 
regime with correlated fading, there exists an additional term thanks to the eigen-beamforming 
gain as well as the well-known multiuser diversity gain term Mlog log A'. As an upper bound 
on the potential gain of transmit correlation diversity in the r <C K' regime, if the AS of group 
g, A g , is close to zero but Rayleigh fading is still valid, then 


lim sup Cf A {K) - £°°(AT) = 31ogM 

A g — 

where C^ A (K) and C°°(K) are the power offsets of Q™(A') and C sum (K), respectively. 

In the case of r > K ', the eigen-beamforming gain could be completely compensated by 
the channel dimension loss, yielding that transmit correlation does not help the capacity. In the 
r -C K' case, however, the channel dimension los^ vanishes in the large K regime as shown 


7 In this case, the channel dimension loss can be interpreted as multiuser diversity reduction with respect to the independent 
fading case due to the fact that user selection is independently performed in a group basis for only K' users rather than K 


users. 
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in Appendix [Bj while eigen-beamforming can still provide the power gain of up to 3 log G dB, 
which can be translated into the rate offset of M log G bps/Hz. This explains why the correlated 
fading case can significantly outperform the independent fading case in this regime. 

Finally, we consider the intra-group cooperation for large AT 7 but not necessarily r -C K' and 


compare its performance with (15) 


Corollary 1. Assuming the intra-group cooperation between the receivers within each group, 
we have 


C sum (P, K) = M log ^ + M log K' + ^log|A g | + o(l). 


(16) 


9=1 


This can be verified with the high-SNR upper bound in (43) in Appendix |A| and the fact that 


E[log | | ] ~ r log K' for large AT 7 , which follows from (56). The intra-group cooperation 

is shown to provide the additional power gain of 3 (log A' 7 — log log A') dB at high SNR for 
large K', compared to ( fl5j ) without cooperation. 

C. Large System Analysis 

We turn our attention to the large number of antennas regime, i.e., the large system analysis. 
For this analysis, we need the asymptotic behavior of large-dimensional Wishart matrices. To 
this end, a common approach utilizes known results from asymptotic random matrix theory © 


(e.g., see [35] based on the Marcenko-Pastur law [36]). In this paper, we shall instead consider 
direct analysis of the asymptotics of the capacity bounds in Theorem 1. 

Let 

M r 

and G be fixed such that both r and K' are taken to infinity along with M. 

Theorem 2. Suppose the perfect CSIT on H g , the unitary structure, and the uniform boundedness 
of X 9t i such that 

0 < e < < 1 ( 17 ) 

s^max 

for cdl g and any i £ Z + . As M ^ oo, for p < 1, the high-SNR capacity of the corresponding 

correlated fading MIMO BCs scales linearly in M with the ratio 
n \ r s 


log + o(l) < lim 
G M—too 


'ffNl _ io g L + (Lin) i 0 g _!_ 

M eg V // / 1 — g 


< o(l) 


(18) 
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where o(l) is a constant with M but vanishes as P —* oo. 

For g > 1, the high-SNR capacity scales linearly in K with the ratio 

log + 0(1) < 1/m _ log ftL + („ _ 1 ) log JL- < o(l). (19) 

G M—>cxd A e p — 1 

Proof: See Appendix [Cj □ 

When M = K with R y = I r for all g, we can see that {p~9]) reduces to 


C s 


'(P, M) , P 

M— = log 7 + o(1) 


( 20 ) 


which equals the well-known capacity scaling of the i.i.d. Rayleigh fading MIMO channel JT]. It 
immediately follows from Theorem [2] that the asymptotic capacity scaling of the i.i.d. Rayleigh 
fading MIMO BC is upper-bounded by 


Q7(P,M,r) P 
M ° g eg 


g 


g 


log 


1 — g 
g 


Q7(P,M,r) , gP , 

nd ’ = log - + (g- 1 log- t 

A e g — 1 


+ o(l), if g < 1 
+ o(l), if g > 1. 


( 21 ) 

( 22 ) 


These are also the upper bounds of the point-to-point case. In particular, 0 for g < 1 is the 
same as [7. Proposition 2]. We can see that the growth rates of the capacity of correlated fading 
channels under the unitary structure are upper-bounded by the i.i.d. Rayleigh fading case in ( |2Tj ) 
and (22) irrespectively of g. Although the additional power gain of up to 3 log G dB in ( [T?] ) in 
the large K regime may seem to contradict the observation from Theorem [2j the assumption 
therein is different in that only K increases to infinity with M fixed, while both M and K here 
increase with a fixed ratio g < 1. 

The assumption of the uniform boundedness of non-zero eigenvalues A fl- , of R g may seem 
unrealistic since R is generally of full algebraic rank even if eigenvalues except dominant ones 
decay quickly. However, it is quite reasonable at least in the large number of antennas regime with 
the antenna configuration of ULA. For this case, it was shown in [17] that non-zero eigenvalues 
of R can be accurately approximated by a set of samples {S([m/M]) : m — 0, • • • , M — 1} 
(with [x] being x modulo the interval [—1/2,1/2]) which has support of length p < 1 on such an 
interval. Here Sf) is the eigenvalue spectrum (discrete-time Fourier transform) of R. This implies 
that non-dominant eigenvalues go to zero when M is sufficiently large. In realistic channels, r 


should be considered as an effective rank denoting the number of dominant eigenvalues [16]. 













18 


For the sake of concreteness, in this paper, we will consider the one-ring model for R , which 
corresponds to the typical cellular downlink case where the BS is elevated and free of local 
scatterers, and the user terminals are placed at ground level and are surrounded by local scatterers. 
In the one-ring model, a user located at azimuth angle 6 and distance s is surrounded by a ring 
of scatterers of radius r such that angular spread (AS) A « arctan(r/s). Assuming the ULA 
with a uniform distribution of the received power from planar waves impinging on the BS array, 
the correlation coefficient between BS antennas 1 < p, q < M is given by 


[R] Piq = —- I e WD(p- q ) S in{u+0) duj 


(23) 


'-A 


where D is the normalized distance between antenna elements by the wavelength. 

In order to better understand the asymptotic behaviors of jj l°g |A S | and hence to 


obtain tighter bounds on the capacity scaling in (18) and (19), we may utilize the following 
approximation. Assuming the ULA antenna, the transmit correlation matrix R g of group g in 


(23) can be given in a Hermitian Toeplitz form. The eigenvalue spectrum S(£) of R g is defined 
by the discrete-time Fourier transform of the coefficients /y = [R g \^t_ k , i.e., 

OO 

S(i) = J2 rte ^' kS ’ [- 1 / 2 , 1 / 2 ]. 

fc =0 

For most cases of interest, this is a uniformly bounded absolutely integrable function over £. 
Then, the limiting behavior of - log | R g can be explicitly expressed by using the well-known 


Szego theorem 1371, [38 ] as follows: 


lim — log | R g | = 


- 1/2 


r—>oc T 


1/2 


log S(£)d£. 


(24) 


For example, for the one-ring scattering model considered in [17] that the eigenvalue spectrum 
5(£) can be characterized in terms of only the geometric channel parameters as 


SK) = 2A 


E 


/cE[D sin(—A+0)+^, D sin(A+0)+£] 




(25) 


In general, we can accurately predict 2 log A g thanks to (24) from the scattering geometry that 
characterizes the propagation between a user group and the base station antenna array, avoiding 
the need for the eigendecompsition of the large-dimensional matrix R y . 
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Fig. 2. Sum capacity vs. SNR curves for M = 8,16 with all satisfying G degrees of transmit correlation diversity, “analy” 
and “simul” represent the high-SNR sum-rate scaling in Theorem |T| and Monte Carlo simulation, respectively. For M = K = 8 
and r = 2, (Ai, A 2 ) is given in the legend. For M = 16, K = 8 and r = 2, the blue curves indicate (Ai, A 2 ) = (8, 8), while 
the red curves (Ai, A 2 ) = (12,4). Finally for M = K = 16 and r = 4, the blue curves indicate (Ai, A 2 , A 3 , A 4 ) = (4,4,4,4), 
while the red curves (Ai, A 2 , A 3 , A 4 ) = (7, 5,3,1). 


D. Numerical Results and Summary 

This subsection provides some numerical results to show the validity of the impact of transmit 
correlation diversity to the capacity in the previous subsections. In order to generate a unitary 
structure, a set of the group eigenvector matrices {U g }^ =1 were obtained by a randomly chosen 
channel covariance matrix, since the arbitrary realizations of the eigenvector matrices do not 
change the capacity as long as U^U g > = I for all g' ^ g. For this structure, we assume A g = A 
for all g for convenience, where A = diag(Ai, • • • , A r ). 

Fig. [2] depicts the ergodic sum capacity versus SNR curves of the i.i.d. and the correlated 
Rayleigh fading MIMO BCs for different M, K, and A with G = 4. In the unitary structure with 
M = K (i.e., r = K'), whether transmit correlation diversity can be beneficial to the capacity 
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Fig. 3. Sum capacity vs. SNR curves for M = 8 with different numbers (K) of users. While G = 4 for the unitary structure, 
9i € [—60°,60°] and A t £ [5°, 10°] for the non-unitary case. 


or not depends on the condition number of A, as mentioned earlier. Moreover, the high-SNR 
sum-rate bounds in Theorem |T| are shown to be tight when M = K. For M — 16, K — 8 and 
r = 2, we can see that our asymptotic bound is also tight for the tall unitary structure with 
M > K and r = K' and that the tall unitary structure suffers from performance degradation 
relative to the unitary structure. 

Fig. [3] also compares the capacities of the independent and correlated fading cases for different 
K with M — 8. For the unitary structure, we set (Ai,A 2 ) = (4,4), i.e., the optimistic case in 
0- for all K . The correlated fading case has a larger capacity than the independent fading 
case for K = 32 (M < K), while transmit correlation diversity does not help the capacity for 
K = 4 (M > K). In addition, we would like to see if the foregoing results on the beneficial 
impact of transmit correlation derived by imposing the unitary structure are still valid for realistic 
channels not assuming the structure. To generate this “non-unitary” (unstructured) case, we utilize 


the one-ring channel model in (23) with the ULA of D = 1/2 (half wavelength) where 9 t is 
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uniformly distributed over the range of [—60°, 60°] and A, is uniformly distributed over the range 
of [5°, 10°], where 0, and A* are AoD and AS of user i, respectively. Then, we can see that, 
for K = 32, transmit correlation is still noticeably beneficial to the capacity even if we did not 
assume the unitary structure. This remarkable result is also observed in the following evaluation. 

Fig. [4] depicts the sum capacity versus the number of users curves for different M. For the 
unitary structure, we set G = 4 for M — 8 and G — 2 for M — 4 with (Ai,A 2 ) = (4,4) for 
all M. The rate gap between correlated and independent fading cases gets larger as K and G 
increase. The exact capacity is shown to be predictable by the analysis in ( |~i3| ) for M — 4, 
while a much larger K would be needed for the same asymptotic capacity in ( [15] ) to converge 
for M — 8. For the non-unitary case, we set A, : e [5°, 20°]. When K = 10,000, the rate gap 
between the independent and non-unitary cases is 4.3 bps/Hz for M — 4 where the potential 
gain of M log G is 4 bps/Hz, while it is 10.2 bps/Hz for M = 8 where M log G = 16. Therefore, 
a surprisingly large portion of the potential rate gain (i.e., power gain) of transmit correlation 
diversity is shown to be achievable for sufficiently large K with the realistic setup where no 
structure of users transmit correlations was assumed. 

Assuming the unitary structure, the full-CSl capacity results in this section can be summarized 
as follows. 

. For r > K' (i.e., M = rG > K ), we can obtain the power gain of at most ^loge 
dB (i.e., f log e bps/Hz rate gain) at high SNR, given the optimistic condition where the 
eigenvalues of R g are approximated by Depending on the condition number of R g , 
transmit correlation diversity is hence even detrimental to the capacity of MIMO BCs in 
this regime. 

• For r < K' but not r <C K', our numerical results indicate that transmit correlation diversity 
yields a considerable capacity gain even when K is not so larger than M. In addition, the 
intra-group cooperation is sufficient to achieve the full-CSI capacity of the point-to-point 
case. 

. For r -C K', transmit correlation diversity can increase power gain by up to 3 log G dB 
over the i.i.d Rayleigh fading BC at any SNR. 

• Although transmit correlation diversity is well defined for the unitary structure, its effect 
is not restricted to the structured case. It is observed from numerical results that a “semi¬ 
unitary” structure is implicitly created by multiuser diversity (i.e., user selection/scheduling) 
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Fig. 4. Sum capacity vs. the number of users for M = 4, 8 at SNR = 10 dB. 


for r < K', yielding that even unstructured transmit correlations of users do help the 
capacity. 

It turns out that transmit correlation diversity may be rather detrimental to the power offset 
of MIMO BCs for M > K except for some optimistic conditions, in contrast to the M < K 
case. This is mainly due to the fact that the former case suffers from the power loss due to 
effective channel dimension reduction, while such a loss vanishes owing to multiuser diversity 
in the latter as K gets much larger than M. 

So far, we have assumed prefect CSIT with no cost, for which in general we cannot do 
better with transmit correlation diversity for M > K. Notice that the typical scenario of large- 
scale MIMO belongs to this unfavorable case which includes M K. Consequently, transmit 
correlation diversity could not improve the performance of large-scale MIMO systems. It will 
be shown in the following section that this argument is not true for realistic pilot-aided systems, 
where CSIT is provided at the cost of downlink training. 
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IV. Fundamental Limits of Pilot-Aided MU-MIMO Systems 

In this section, we investigate asymptotic capacity bounds of pilot-aided MU-MIMO systems, 
in which the resources for downlink training are taken into consideration. While FDD systems 
make use of downlink common pilot and CSI feedback, TDD systems employ uplink dedicated 
pilot to exploit the uplink-downlink channel reciprocity for downlink training. Both FDD and 
TDD need dedicated pilots for users to estimate their downlink channels for coherent detection 
J39) . Dedicated pilots go through the downlink beamforming vectors and hence they can be sent 
all together on the same time-frequency resource without significant interference, provided that 
M is sufficiently large, so that they may consume a negligible resource. Therefore, we focus 
on the cost of the common downlink pilots in FDD, where the CSI feedback issue was already 
addressed in CD. or the uplink per-user pilots in TDD. 

In the independent fading channel, the downlink FDD system uses M* id downlink dimensions to 
allow users to estimate the M* id -dimensional channel vectors, where M* id = min{M, K, \ TJ 2J }. 
Assuming that CSIT is acquired by the base station through a delay-free and error-free feedback 
channel]^] the high-SNR capacity of MU-MIMO downlink systems is upper-bounded by 

M ; t d (l - M*JT c ) log SNR + 0(1). (26) 


If we optimize the number of (active) base station antennas and users as a function of the channel 
coherence time T c , it turns out that 


Ml 


iid 


1 - 


Ml 


iid 


T r 


T c 

< —, 

“ 4 ’ 


where the upper bound is attained by activating only Tj 2 antennas to serve TJ 2 users. Therefore, 
the system multiplexing gain does not scale with min(M, K) and the per-user throughput 
vanishes as 0(jr), when K becomes large and T c is fixed. This upper bound is also valid 
for TDD systems with reciprocity. For example, the number of scheduled users (specifically, 
s = M* id for the independent fading case) among the entire K users is limited by uplink pilot 
overhead (also affected by T c ), which is optimized by letting s = T c /2 and devoting half of the 


coherence block to uplink training (see [27]). If instantaneous feedback within coherence time T c 


s This is clearly an over-optimistic assumption, but it is in general good enough to obtain a simple bound on what it is actually 
possible to achieve through realistic feedback implementations. 
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is not possible, the impact of the resulting channel prediction error on the system multiplexing 


gain can be found in [39]. 

As already pointed out, the above factor T c / 2 significantly limits the system performance for 
both M and K large. However, noticing that this result holds true in the independent fading 
channel, we are to characterize some performance limits in correlated fading channels under the 
unitary structure in the following. 


A. Training Overhead Reduction 

1) FDD (Multiple pre-beamformed Pilot): The common pilot is in general isotropically 
transmitted, since it has to be seen by all users. We first consider a simple training scheme 
for FDD systems, where the downlink common pilot signal X dl for group g is given by the 
pre-beamforming matrix B g as follows: 



Ptr® g 


where p tr indicates the power gap between the training phase and the communication phase. 
Assuming the unitary structure, we can just let B g = U g and then the received pilot signal 
matrix for group g is given by 


Y d ' = H"X dl + Z d ' = p tI H"+Z 


(27) 


where X‘" = , X a g . This indicates that G pre-beamformed pilot signals x A g i , \/g, where x A g i 

is the ith column of X A g , can be multiplexed and transmitted through a single pilot symbol and 
hence the overall common pilot signal X dl consumes only r symbols, reduced by a factor of G. 

Based on the above noisy observation of the pilot signal, each user in group g can estimate the 
effective channel h = U^h, which is unitarily equivalent to /i in (JTJ under the unitary structure, 
as shown in Sec. II-C Therefore, the proposed common pilot does not incur any loss due to 
pre-beamforming, as if it were a conventional pilot signal isotropic to all users. A generalization 


of the above scheme was already given in [17], which chose X A j = B g U ai with U a[ being a 
scaled unitary matrix of size r x r, thereby making the downlink common pilot signal for each 
of antennas spread over r pilot symbols. However, the previous work did not consider only an 
optimization of the system degrees of freedom taking into account the cost for downlink training 
dimension but also TDD or uplink systems in the next subsection. 


rdl 
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2) TDD: The same idea above can be naturally applied to the TDD case with receive 
beamformer for the uplink dedicated pilot. To be specific, the received pilot signal matrix 
for TDD systems can be given by 

G 

Y al = Y J H g pJ K '+Z' iX . 

9= 1 

By receive beamforming, i.e., multiplying from the left by for group g, we have 


Y u g l = U^Y ul = p tr H g + Zf (28) 


where Z"] = U^Z ul . The uplink dedicated pilot signal for all K users consumes only K’ 
symbols, reduced by a factor of G again. As a result, we can obtain the pilot saving not 
only in FDD systems but also in TDD systems, where the unitary structure is uplink-downlink 
reciprocal. Notice that such a pilot saving is also valid for MIMO MAC, i.e., MU-MIMO uplink 


systems. In [211, a similar idea to the unitary structure was differently used to eliminate the pilot 
contamination effect in the multi-cell TDD network instead of reducing the overhead for uplink 
dedicated pilot in each single cell. 


B. Pilot-Aided System I 

Assuming the unitary structure in a symmetric fashion such that M = rG with the perfect 
knowledge on channel second-order statistics available at the transmitter, we can have up to G 
groups with r (long-term) eigenmodes each. However, using too many eigenmodes per group may 
degrade the performance of pilot-aided systems due to the cost of downlink training. Inspired 


by the pilot-aided system in [24], this section is devoted to maximize the system multiplexing 


gain for the pilot scheme in Sec. IV-A with T c finite. Since we have to use all the M antennas 


to preserve the unitary structure, we cannot directly follow the same line in [24]. Rather, since 
it does not make sense to use less than G degrees of transmit correlation diversity available in 
the MU-MIMO system, we just need to investigate how many eigenmodes (instead of active 
antennas) per group should be used in the communication phase. Although we focus on the 
downlink system in this section, the same multiplexing gain can be achieved in the uplink 


system according to (28). 
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Suppose that we use q of r eigenmodes per group in the communication phase for FDD 
system^} where q < r. Then, the number of degrees of freedom for communication within each 
group is upper-bounded by 

minjg, A"} (T c - q) (29) 


where we devote only q channel uses for the training phase thanks to the pilot scheme in Section 


IV-A. We call this pilot-aided system I in this work. The optimal number of eigenmodes per 


group to maximize (29) is given by 


q* = min < r, K', 


(30) 

yielding the pre-log factor q*G(l — |r). This indicates that there is no need for using more than 
q* eigenmodes per group. Therefore, we obtain the following results: 

. Assuming the unitary structure with G degrees of transmit correlation diversity, the high- 
SNR capacity of pilot-aided MU-MIMO systems is upper-bounded by 


M* 1 — 


M* 

TnG 


log SNR + 0(1) 


where M* = q*G = min {M, K, |_ppj }• 

• Then, we have the fundamental limit on the system multiplexing gain 


lim M* 

min{M,A'}—»oo 


1 - 


M* 


T C G 


(31) 


for T C G e 2Z + . 

It turns out that, for both M and K large, exploiting G degrees of transmit correlation diversity 
can increase the system multiplexing gain by a factor of G, compared to the independent fading 
case. It is evident that as long as the degrees of transmit correlation diversity is sufficiently 
large such that G > 2 min {M, K}/T c (i.e., M* = min {M,K}), the optimal number M* of 
eigenmodes is not affected any longer by the coherence time interval T c . As a consequence, 
the system multiplexing gain is not saturated but rather it can keep growing as min{M, AT} 
increases. If M* = M (or AT), the high-SNR capacity of the pilot-aided system is (|7j) (or ([8])). 


9 In the TDD case, it suffices to suppose that we schedule q of K' users, where q < K ', and to optimize the degrees of 
freedom with respect to q taking into account the uplink pilot overhead. 
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Fig. 5. Impact of the degrees of transmit correlation diversity (G) on multiplexing gain for different numbers of min(A/, K) 
in pilot-aided systems, where the solid lines indicate T c = 32 and the dash-dotted lines indicate T c = 100. 


For M* = |_^ 7 pJ, the high-SNR capacity equals to ([8]) with K and K' replaced by [^f\ and 
L^J, respectively. 

The following example compares the upper bound ( [26] ) on the system multiplexing gain for 
the independent fading case and the new upper bound pT] ) for the correlated fading case, when 
T c is taken from real-life cellular systems. 


Example 1 . Let T c take either 32 long-term evolution (LTE) symbol duration [40] (approximately 
60 km/h) or 100 symbol duration (19 km/h). Also, suppose that the unitary condition is attained 
such that G = 4 and G = 8. Fig. [5] shows the Zheng-Tse upper bound, M* id ( 1 — M* jd /T c ), and 
the new bound, M*( 1 — M*/T C G), on the system multiplexing gain as min {M,K} increases. 
It can be seen that exploiting transmit correlation diversity can increase the multiplexing gcdn 
by a factor of 4 for G = 4 and 8 for G = 8, respectively. 


In the following, we consider two asymptotic cases. While the first case is when G becomes 
large for r fixed as M increases, the other is for G fixed with r large. 
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1) Large G Case: So far, we have assumed a fixed number of degrees of transmit correlation 
diversity, G. We now turn our attention to the case where as M —>• oo, G also grows such 
that the ratio G/M is not vanishing (i.e., bounded). In practical systems, we generally consider 
a large-scale array in the high carrier frequency f c due to the space limitation of large-scale 
array and the fact that the wavelength is inversely proportional to f c . Then it is fairly reasonable 
to increase M proportionally as f c grows and hence to let M depend on f c . It was observed, 


e.g., in mm-Wave channels [41], that the higher f c , the smaller number of strong multipaths 
that the receivers experience due to higher directionality. This sparsity of dominant multipath 


components is also verified by mm-Wave propagation measurement campaigns [42], Thus, the 
high transmit correlation diversity is attainable in the high f c case. Then, as both M and f c 
grow, G may continue increasing such that G/M is fixed. 

Since the impact of noisy CSIT on the system performance under the unitary structure was 
already addressed in [ |T7| , we rather focus on the impact of pilot saving on the high-SNR capacity 
in this work. Thus, the BS can acquire noiseless (error-free) CSIT at the cost of either downlink 


common pilot or uplink dedicated pilot, according to (27) and (28). Denote by C^ m (P, M*,v) 
the high-SNR capacity of pilot-aided system I for M* and v large, where v — G or r and 
the subscript pi indicates the pilot-aided system I. Assuming that the perfect CSIT is provided 
by an ideal (i.e., delay-free and error-free feedback) uplink with no channel estimation error in 
FDD and neither calibration error nor pilot contamination in TDD, respectively, C^ m (P, M*,v) 
is simply given by 

C—(P,M‘,v ) = (l - £) C~(P,M\v). 

In what follows, using the results of Section III, we refine the 0(1) term in pT) first in the 
large G regime and then in the large r regime. Let 

M* q* 


fi'pi 


K K' 


be fixed. In this scenario, G is taken to infinity along with M but both r and K' are finite, 
unlike Theorem [2| Therefore, we shall make use of pi] Thm. 2.11] instead of Lemma |T] then 
we can apply Theorem |T| in the sequel. 


Theorem 3. Suppose the unitary structure and the uniform boundedness of X gj in ( |/7[ ). As 
M —> oo, for /i < 1, the high-SNR capacity of the pilot-aided system I scales linearly in M* 
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with the ratio 


T r 


1 - — ) log/xe + o(l) < 


Cp U / n {P, M*, G) 


M* 


P 


1 T c 

K' 


x \ log — + log e | 


(i-, P , 

1=2 


Ppi 


K' 

E 

t=(l-p p i)K'+l 


< 0(1)- 
(32) 


For p > 1 , the high-SNR capacity scales linearly in M* with the ratio 


, q'\, , ,,, „ C 7 ,-(P,M\G) 

T c ) K J ~ M* 


P 


K' 


1 


1 ~T ) + M -7 + ^7 ) \ <o(l). 


1=2 


(33) 


Proof: For p < 1 and q* < Tj 2, we have M* = M,q* = r, p pi = p. In this case, we use 
0 instead of (18). Then, the growth rate at which the high-SNR capacity increases in the large 
G regime as M —» oo is lower-bounded by 


G 


C™JP,M,G) P ^ 

- > log— + loge^ 


M 


9=1 


r 

M 


K ' 1 TC K ' 1 

1 K — r v—- 1 
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+EE 
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1=2 


l=K'-r +1 


G 


M 


^ lo g| A 9 | +o(!) 


9=1 


l° g E + iog e ( -7 + X^7+ (1 ^ 


K 


1=2 


P 


7 )+ 1 °g^ e + °( 1 ) 

1=(\-H)K'+1 


(34) 


where we used (17) since We can similarly get the upper bound in (32) for q* < TJ 2. 

When q* = TJ 2, the rate of growth for the p < 1 case can be obtained in a similar way by 
noticing M* = JJ. Therefore, for these two cases in the large G regime with q* fixed, we get 


(32). 


For q* < TJ 2 and p > 1, noticing that M* = K, q* = K', p pl = 1 and using 0, we get 


C sum (P, K, G) 
K 


P 


G 


K' 


A" 


G K’ 


>log- + logeJ] —( -7 + ^7) +^5Z^logA 9ii + o(l) 


9=1 


K 


1=2 


I< 


9=1 *=1 


K 1 


= l° g R , + log e( -7 + 5^7 ) + loge+ o(l) 


t=2 


(35) 
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When q* = T c / 2 and p > 1, the growth rate can be obtained again by noticing M* = 


Then, we obtain ( |33| ) for p > 1. □ 

The following result shows the most optimistic gain of transmit correlation diversity in the 
limit of A g —> 0 for all g, which we provide as a capacity upper bound even though this channel 
assumption may seem unrealistic. 

Corollary 2. For p = 1 and T c > 2, as A g — >■ 0 and M —> oo with p fixed, the high-SNR 
capacity of the pilot-aided system I scales linearly in M with the ratio 


C% m (P,M,G) . P 

lim limsup —- = (l — T log —. 

oo A a —*0 M v ’ e 


(36) 


To prove this, we first notice that the condition of pT) can be restated as T c > 2 min(r, K') in 
this case. Hence the sufficient condition is guaranteed just for T c > 2, since r —> 1 as A g —> 0. 


Using this and (32), (36) immediately follows with r/T c = T ( 


-l 

C 


It is remarkable that if the unitary structure is attained with T c sufficiently large and A g 
sufficiently small, the high-SNR capacity of MU-MIMO systems approaches the full-CSI capacity 


in (20). The systems of interest are scalable in min (M,K) and also the user throughput does 


not vanish any longer unless M <C K (i.e., p <C 1), in sharp contrast to (j26j). It should be 
pointed out that the growth rate in Corollary [2] was not obtained just by pilot saving but also by 
the power gain due to eigen-beamforming. 

2) Large r Case: In the large r regime where r goes to infinity while G fixed, we can obtain 
the following result by using Theorem [2j For p pl < 1 


M* 


T r 


P +(Ljf£l|l 0 g 


ep p \ 


fipl 


1 Py\ 

(37) 

Note that the p v \ > 1 case does not happen in pilot-aided system I, since we make use of only 


K' eigenmodes regardless of how large r is, i.e., q* = K' according to (30). This restriction in 
system I may cause a nontrivial rate loss for M > K, as will be discussed in the next subsection. 

Fig. [6] shows the sum-rate upper bounds in ( [32] ), ( |33] ), and ( [37] ) on the asymptotic capacity 
for different system parameters in pilot-aided systems I. For large M, the Zheng-Tse bound 
is given by M* id ( 1 — M* id /T c ) log - + o(l). For large G and fixed q*, the system multiplexing 
gain grows linearly with min {M,K}, whereas this is not the case with large q* and fixed G. 
To understand the large rate gap between p = 1 and p = 2, recall the eigen-beamforming 


<o(l). 
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Fig. 6. Asymptotic sum-rate upper bound curves versus min(M, K ) in pilot-aided system I at P = 30 with T c = 50, where 
q* = 10 when G is large (v = G ), and G = 10 when q* is large (v = q*). 


gain of up to log /xG in Section III-B and that a dual MAC is equivalent at high SNR to the 
corresponding MIMO point-to-point channel with K transmit antennas and M receive antennas. 
The equivalent MIMO channel is well understood to have a logarithmic power gain scaling with 
M due to receive beamforming. For /x = 0.5 case, the large rate gap from /x = 1 is because the 
upper bound was given by allowing the intra-group cooperation within each group. Finally, for 
large q* but fixed G, the two cases of /x = 1 and /x = 2 collapse into the red solid line. This 
is due to the fact that pilot-aided system I considers only multiplexing gain but not power gain, 
which will be addressed in the following subsection. 


C. Pilot-Aided System II 

In the large M regime, the M > K (/x > 1) case may be more frequently encountered in 
realistic systems, which is also the typical scenario of large-scale MIMO. We introduce a new 
pilot-aided system to address the foregoing issue for this case with r large but G fixed. In 
contrast to pilot-aided system I, in which only K eigenmodes are used by letting M* = K when 
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Fig. 7. Values of f(Q) versus the number of eigenmodes to use, q, in pilot-aided system II when M > K (i.e., fi > 1), where 
M = 200, T c = 64, and the ‘o' indicates the optimum numbers of eigenmodes, M* 2 , and the ‘x’ indicates M*. 


K = min {M,K, }, we shall allow in the new system referred to as pilot-aided system 

II to use more than K eigenmodes, even though the degrees of freedom is certainly at most 
K. By doing so, we may obtain a noticeable power gain suggested by ( [T9] ) owing to transmit 
correlation diversity which compensates for the increase in channel uses required for downlink 
training. To understand this, notice that using more than K eigenmodes has a smaller impact on 
the system multiplexing gain as G and/or T c grows, as shown in ( f3~i~[ ). 

To take into account the additional power gain from using more than M* eigenmodes in 


pilot-aided system II, we replace the optimization problem in (29) with the following one based 


on the upper bound in (19). 


M *2 = argrnax f(q) (38) 

<? 

subject to q* < q < r, where f(q) = M*{( 1 — ^-jlog^^y + — ljlog^^-} with M* 

(the maximum number of degrees of freedom for the communication phase) unchanged and the 
subscript p2 indicates the pilot-aided system II. The high-SNR capacity of this new pilot-aided 
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Fig. 8. Asymptotic sum-rate upper bounds of two pilot-aided systems where ^ = 2, G = 10, and P = 30. 


system for /i > 1 scales linearly in if with the ratio 


ill 


P2 


T C G 


, A mi Cg*(P,M£,r) 

log -pp- + o(l) < — 


G 


- 1 - 


M 


P2 


T,G 


if 

log + ( /ip2 - 1) log 


lp2 


lp2 1 


+ Cp2,2 \ < o(l) (39) 


where /r p2 = 

Fig. [7] shows the optimum number of eigenmodes, M* 2 , for different P and /i with M = 200. 
Here, ill* = If = 40 for // = 5 and ill* = if = 100 for /i = 2. Therefore, if we consider the 
power gain due to eigen-beamforming as well as the system multiplexing gain, the optimum 
values of M* 2 are shown to be quite different from ill*. We can also see that the resulting rate 
gap is reduced as P increases for T c = 64. Fig. [8] compares the asymptotic sum-rate upper 
bounds of pilot-aided system I and II when T c = 32 and T c = 128. It is shown that the rate gap 
gets larger as T c increases, since, for large T c , the extra overhead due to training more than if 
eigenmodes reduces, as mentioned earlier. 
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Remark 2. So far, we have assumed T = 1 such that the channel covariances of all users 
associated to the BS satisfy a single unitary structure, which may seem far from practiced 
systems. Recall that if we extend to the case of multiple classes as shown in Fig. [7] each 
class should consume orthogonal time/frequency resources. Then the pilot design in Sec. \IV-A 
must increase the pilot overhead by a factor of T. Assuming homogeneous classes (the same 
number of homogeneous user groups per class), the pre-log factor in (p~7j) is then replaced with 


M* 


f M*T\ 


(40) 


where M* = min {M, K, |_^7j }• This may undermine the potential gain of transmit correlation 
diversity. Therefore, we have the system design guideline that T should be less than the number 
(G) of degrees of transmit correlation diversity and it must be restricted as small as possible. If 
T > G, there is no point in using the multiple pre-beamformed pilot in Sec. IV-A\ 


Finally, it should be pointed out that transmit correlation diversity can promise a significant 
capacity gain in all regimes of interest, considering the cost of downlink training. Even though 
the new diversity may bring a power loss to the capacity compared to the independent fading 


case assuming perfect CSI in Sec. Ill the increase in multiplexing gain can fully offset the 
power loss unless min {M,K} is too small. 


V. Concluding Remarks 

In this paper, we have investigated several asymptotic capacity bounds of correlated fading 
MIMO BCs to understand the impact of transmit correlation on the capacity. In order to intuitively 
show and fully exploit the potential benefits of a new type of diversity — transmit correlation 
diversity, we imposed the unitary structure on channel covariances of users. Assuming perfect 
CSIT without a cost of downlink training, we showed that transmit correlation diversity is 
not always beneficial to the high-SNR capacity of Gaussian MIMO BCs in all regimes of 
system parameters of interest such as M, K, r, G, and T c . In particular, the new diversity is even 
detrimental to the capacity in the large-scale array regime. Taking the cost for downlink training 
into account, however, we found that transmit correlation diversity is indeed very beneficial 
in all regimes of interest. Specifically, the system multiplexing gain can continue growing as 
the number of antennas and the number of users increase, as long as the degrees of transmit 
correlation diversity are sufficiently large. Even if we have focused on the downlink system in 
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this work, notice that the new diversity can be leveraged in various forms of MIMO wireless 
networks including multi-cell uplink/downlink systems and wireless interference networks. 

It was shown that the eigen-beamforming gain due to pre-beamforming along large-scale 
eigensapces of user groups is essential to achieve capacity benefits of transmit correlation 
diversity. This provides an insight that a precoding scheme which can realize a large portion 
of such a gain may be competitive to ZFBF with noisy or outdated CSIT for correlated fading 
channels particularly in the large K regime. In order to validate this argument, our recent work 


in [43] proposed a new limited feedback framework for large-scale MIMO systems. 

In MIMO wireless communications, there exist three most essential resources: time, frequency, 
and “small-scale” space that depends on instantaneous channel realizations. Apart from these 
resources, we have identified the new type of resource, transmit correlation diversity (namely, 
“large-scale” spatial resource), and provided some new insights on how to use it, when it is 
beneficial to the capacity, and how much it affects the system performance. The most remarkable 
result can be summarized as: Exploiting transmit correlation may increase the multiplexing gain 
of the MU-MIMO system, when taking into account the downlink or uplink training overhead, 
by a factor equal to the degrees of transmit correlation diversity, i.e., the number of user groups 
with mutually orthogonal (or linearly independent for a weaker condition) channel covariance 
eigenspaces. 


Appendix A 
Proof of Theorem □ 


We first prove the case of r < K'. Provided the unitary structure is available, the sum rate of 
the gth dual MAC subchannel in ([6]) can be rewritten as 


log 


I + A^WgSgW^A 1 / 2 


= log I A, 


WgSgW" I 


+ l°g I Ag 


(41) 


By allowing the intra-group cooperation (i.e., the receiver cooperation within each group) and 
following the standard argument, the capacity region of the dual MAC subchannel is outer- 
bounded by that of the corresponding cooperative MIMO system. Given the perfect CSIT and 
at high SNR, the asymptotic optimal input X g in the cooperative MIMO system is the uniform 
power allocation over r eigenmodes of W g W g with tr(X g ) < P, since the Wishart matrix 
W g W" is well conditioned with high probability for all g. Here, the difference with our problem 
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of interest is that the noise variances at r effective antennas of the receiver in the gth dual MAC 
in the RHS of ( |4Tj ) are scaled by \ 9)i , where i — 1, • • • , r. But this does not change the known 
result. Then, we have at high SNR (i.e., large P ) 


E 


max 
Ltr (s g )<p 


loglA-'+HW^ 


< E 

~ E 
~ E 


max log | A a 1 + W g X g Wg 


Ltr(x 9 )<p 
log 


P. 


\- 1 4- —W W H 

9 9 9 


log | W g W" 


+ rlo g T 


(42) 


where ~ denotes the asymptotic equivalence (the difference between both sides vanishes as 
P —> oo). As a consequence, when r < K', the sum capacity is upper-bounded as 
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C sum (p) < M log ^ E [ log \W g W" | j + log I A, I + o(l) 
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(43) 
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+ E i)loge + J]log|A g |+o(l) 
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(44) 


where we used the well-known result of random matrix theory in [ |3T| Thm. 2.11], namely, 


E 


In | W g W" | = ZZo K’ - £), where W y W" is almost surely nonsingular and ^{n) = 

—7 + 7 with 7 the Euler-Mascheroni constant. From (55), the second equality in (44) 

immediately follows. 


The lower bound in (|7j) is given by simply letting the diagonal input matrix S g as S g = j^I k 
for all g, which is in fact the optimal input covariance when only the channel distribution is 


accessible at the receiver in MIMO MAC with each user having the same power constraint [281. 
The resulting W g S (J W" = ^ W g W ^ is also a Wishart matrix with r degrees of freedom and 
hence 


P 


log (A; 1 + W g S g W"\ > log \W g W"\ + r log - + o(l) 


9” 9 
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(45) 


Using [31 Thm 2.11] again, we have 
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Then, the high-SNR capacity upper and lower bounds differ by Mlog -P. 
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Next, we consider the second case of r > K'. When the number of transmit antennas is 
greater than or equal to the total number of receive antennas in a MIMO BC, the sum capacity 


of its dual MAC is well known [29] to be equivalent at high SNR to that of the corresponding 
point-to-point MIMO system. This also implies that the uniform power allocation across K' 
eigenmodes, i.e., S g = is asymptotically optimal for the (jlh dual MAC (|6j) equivalent 

to the the point-to-point channel where the number of receive antennas (M) is larger than the 
number of transmit antennas (K). Since transmit correlation is only harmful to the capacity of 
the equivalent point-to-point channel for perfect CSI and large P, the capacity of each dual 
MAC is upper-bounded by the i.i.d. Rayleigh fading channel where X g .i — y- — G for all (g. i ) 
due to tr(A ff ) = M. Then, we have 
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log 


Ir+A^WgSgW^Al/ 2 


~ log 
< log 


Ir + -A l /2 W 9 W"Al/ 2 
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(47) 


For the lower bound, we can get 
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i=K >+1 


where (a) follows from the following lower bound on the determinant of the sum of two 


Hermitian matrices [44 ]: Let A and B be Hermitian matrices with eigenvalues A| (A) > \ 2 (A) > 
■ ■ ■ > A n (A) and Ai (B) > A 2 {B) > ■ • • > A n (B), respectively. If A n (A) + A n (B) > 0, then 

n n 

II ( X ^ A ) + < \A + B\ < J] (Aj(A) + A n -i+i(B)) . (49) 


2—1 


2—1 


In (6), we used from the fact that the non-zero eigenvalues of W g W^ are the same as those of 
W^W y . Similarly using the upper bound in (49), we can obtain another high-SNR upper bound 
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based on the same inequalities in [44] as 


K' 


log | a; 1 + W g S g W" | + log \Ag\ < log I W"W g \ + K' log - + log ]J X 9ti . 


i= 1 


However, the upper bound in ( [47] ) is clearly tighter than this one. 

Using (47), (48), and the fact that, for r > K', the Wishart matrix W^W g is almost surely 
nonsingular and hence invoking [31 Thm 2.11] again, we have 

G K' 
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Therefore, we have ([8]). This completes the proof. 


(50) 


Appendix B 
Achievability of ( p~5] ) 


The achievability proof of ( jT3| ) begins with (41), Corollary 1 in [18], and the uniform power 
allocation over groups such that S g = jjl r , yielding 

G r-, G 


r log log K' + M log + Y log I Agl + o(l). 


9 =1 
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(51) 


9=1 


Compared to M log log K in the i.i.d. Rayleigh fading case, the multiuser diversity gain reduces 
to Y, g =i r log log K'. To show that the loss due to this diversity gain reduction (or the channel 
dimension reduction) vanishes for sufficiently large K ', we use the logarithmic identity 


log c (a±b) = log c a + log c 



( 52 ) 


where a and b are nonnegative. Then, we get 

G 

Y, r log log K' = M log log K + o(l) 

9=1 

for large K'. This proves the achievability. 








39 


Appendix C 
Proof of Theorem [2] 

The proof begins with the dual MAC in ([6]) divided by M 


G 


M 


9=1 


£ log / + fCpw g s e w"hf 


G G 

1 E >°s I-A 1 +w," I + 4 E log i A s 


M 


(53) 
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where the equality is given by ( |4T] ) and the assumptions. 

The following lemma shows a useful asymptotic behavior of the central Wishart matrix. 

Lemma 1. For m large with the ratio r] — — fixed, the central Wishart matrix WW H with W 
the m x n matrix, where n>m, shows the asymptotic behavior 
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Proof: The proof of (54) can be immediately given by applying 
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and by using the fact that f>(k) behaves as 
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For p < 1 (i.e., r < K') at high SNR (P), taking expectation on the first term in the right-hand 
side (RHS) of (|53]), we have the upper bound 
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where (a) follows from (42) and (b) follows from (54) in Lemma [Tj From (45) and (54), we 
also get the lower bound 
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The second term in the RHS of ( [53] ) can be bounded by using 
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where we used tr (R g ) = tr(A g ) = M and the geometric-arithmetic mean inequality \A\ < 


Tr(A) 

< a 


)“ with a being the rank of an n x n matrix A. Combining (57) - (59), we obtain (18). 
For /r > 1 (i.e., r > K') and large K', similar to the above steps with 
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where the last inequality follows from (47), we can obtain ( [T9] ) by using (47), (48) and (54). 
The remaining details are omitted for the sake of the compactness of this paper. 
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